RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number; /o /o30 / 6STj? 

Source: , ffHjJO * 

.■-©ate Processed by«STIC: 3> II) of Z 

* THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-308-4221 
Effective J 2/13/03 : TELEPHONE: 571 -272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

http://www.u$nto.gov/vvcb/ofriccs/pac/checker/chkr4 lnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 
a possibility that the disk/CD-Rom may have been affected by treatment Riven t o all incoming mail. 
Please consider using alternate methods of submission for U^lfiskyCD-Rom or replacement disk/CD-Rom. ~ 4 
Any reply including a sequence listing in electronic fornrdiPold NOfTe sent to the 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via die following to the indicated addresses: 

1 EFS-Bio (<http://w\vw.usnto.gov/ebc/cfs/downloads/documents.htm> . EFS Submission 
User Manual - cPAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Box Sequence, Customer Windo\y, Lobby, Room 1B03, Crystal Plaza Two 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, oc other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 4B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Pftcc, Arlington, VA 22202 



BIOTECHNOLOGY ^ CT S 




Revised 10/0S/03 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



i 



_Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping.** 

Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

^Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 



Non-ASCII 



_Variable Length 



^Patentln 2.0 
"bug" 



Skipped Sequences 
(OLD RULES) 



The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused Jhe <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



_Skipped Sequences Sequence(s) 

(NEW RULES) <2 1 0> sequence id number 
<400> sequence id number 
000 



ioJ 



Use of n's or Xaa's 
(NEW RULES) 



1 1 



Jnvalid <2I3> 
Response 



JJse of <220> 



12 



_PatcntIn 2.0 
"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 

Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses 

Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic. material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



. Misuse of n/Xaa "n" can only represent a single nucleotide : "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



RAW SEQUENCE LISTING DATE : 03/02/2004 

PATENT APPLICATION: US/1 0/030 , 658 TIME: 08:47:46 

Input Set : A:\2004-01-22 4456-010lP.st25.txt 
Output Set: N:\CRF4\03022004\J030658.raw 

3 <110> APPLICANT: Yamamura Ken-ichi 

4 Araki Kimi 

6 <120> TITLE OF INVENTION: TRAP VECTORS AND GENE TRAPPING USING THE SAME 
8 <130> FILE REFERENCE: 4456-0101P uo_lin^ ititL MMJl 

10 <140> CURRENT APPLICATION NUMBER: 10/030 658 

11 <141> CURRENT FILING DATE: 2002-01-11 

13 <150> PRIOR APPLICATION NUMBER: JP99/200997 

14 <151> PRIOR FILING DATE: 1999-07-14 
16 <160> NUMBER OF SEQ ID NOS : 14 

18 <170> SOFTWARE: Patentln Ver . 2.0 

20 <210> SEQ ID NO: 1 . 

21 <211> LENGTH: 13 r ^'■'< l C:-y 

22 <212> TYPE: DNA IV v Crt^-. 

23 <213> ORGANISM: Artificial Sequence ft 1 
25 <220> FEATURE: Y 

28 <1o0 3 : SEQUK N SfT ATI0N: D * SC ^™ ° f Artificial Se g uence : synthetic DNA 

29 taccgttcgt ata 

31 <210> SEQ ID NO: 2 13 

32 <211> LENGTH: 13 

33 <212> TYPE: DNA 

34 <213> ORGANISM: Artificial Sequence 
36 <220> FEATURE: 

39 till SEQUENSfT^ 10 " ° eSCript - ° f artificial Se.uence : synthetic DNA 

40 tatacgaacg gta 

42 <210> SEQ ID NO: 3 13 

43 <211> LENGTH: 34 

44 <212> TYPE: DNA 

45 <213> ORGANISM: Artificial Sequence 
4 7 <22 0> FEATURE: 

50 till SEQUENjEfT^ 10 ^ DSSC ^ ti0n ° f artificial Sequence : synthetic DNA 

51 ataacttcgt atagcataca ttatacgaag ttat o 4 

53 <210> SEQ ID NO: 4 

54 <211> LENGTH: 13 

55 <212> TYPE: DNA 

56 <213> ORGANISM: Artificial Sequence 

58 <220> FEATURE: 

59 <223> OTHER INFORMATION : Description of Artificial Sequence : synthetic DNA 

61 <400> SEQUENCE: 4 y eilC UNA 

62 ataacttcgt ata 

64 <210> SEQ ID NO: 5 



file://C:\CRF4\Outhold\VsrJ030658.htm 
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RAW SEQUENCE LISTING DATE : 03/02/2004 

PATENT APPLICATION: US/10/030,658 TIME: 08:47:46 

Input Set : A:\2004-01-22 4456-0l01P.st25.txt 
Output Set: N:\CRF4\03022004\J030658.raw 



65 <211> LENGTH: 13 

66 <212> TYPE: DNA 

67 <213> ORGANISM: Artificial Sequence 
69 <220> FEATURE: 



72 <IIT> S«eT TI ° N: — iption of Artificial Sequence : synthetic DNA 



73 tatacgaagt tat 

75 <210> SEQ ID NO: 6 1 ' 

76 <211> LENGTH: 34 

77 <212> TYPE: DNA 

78 <213> ORGANISM: 

80 <220> FEATURE: 

81 <223> OTHER INFORMATION: Homologous recombination sequence 
83 <400> SEQUENCE: 6 



34 



84 taccgttcgt atagcataca ttatacgaac ggta - , 

86 <210> SEQ ID NO: 7 " 

87 <211> LENGTH: 19 

88 <212> TYPE: DNA 

8 9 <213> ORGANISM: Artificial Sequence 

91 <220> FEATURE: 

11 lllf *l HER INFOR ^ION: 21 forward primer used in PGR for B-geo detection 

94 <400> SEQUENCE: 7 ^^ouron 

95 gcgttaccca acttaatcg 

97 <210> SEQ ID NO: 8 

98 <211> LENGTH: 18 

99 <212> TYPE: DNA 

100 <213> ORGANISM: Artificial Sequence 
102 <220> FEATURE: 

"I <llt> °r 0 ^ FO r T1 ° H ' 22 """" used ln PC foI B-„„ «« iln 

106 tgtgagcgag taacaacc 

108 <210> SEQ ID NO: 9 

109 <211> LENGTH: 22 

110 <212> TYPE: DNA 

111 <213> ORGANISM: Artificial Sequence 
113 <22 0> FEATURE: 

replication 23 " INF ° RMATI0N: °" 2 *>™ar d Pr±=»r used xn PCR for detecting the 

115 origin region in pUC vector 

117 <400> SEQUENCE: 9 

118 gccagtggcg ataagtegtg tc 99 

120 <210> SEQ ID NO: 10 ' 

121 <211> LENGTH: 21 

122 <212> TYPE: DNA 

123 <213> ORGANISM: Artificial Sequence 
125 <220> FEATURE: 

replication"* INF ° RMATI0N: 0ri3 -^se primer used rn PCR for detecting the 

127 origin region in pUC vector 

129 <400> SEQUENCE: 10 

130 cacagaatca ggggataacg c 2i 
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RAW SEQUENCE LISTING DATE: 03/02/2004 

PATENT APPLICATION: US/lO/030 , 658 TIME: 08:47:46 

Input Set : A:\2004-01-22 4456-010lP.st25.txt 
Output Set: N:\CRF4\03022004\J030658.raw 

132 <210> SEQ ID NO: 11 

133 <211> LENGTH: 400 

134 <212> TYPE: DNA 

135 <213> ORGANISM: Mus musculus 

137 <220> FEATURE: 

138 <221> NAME /KEY : misc_f eature 

139 <222> LOCATION: (36).. (36) 

140 <223> OTHER INFORMATION: n is a, c, g, or t 

142 <220> FEATURE: 

143 <221> NAME /KEY : misc_f eat ure 

144 <222> LOCATION: (70).. (70) 

145 <223> OTHER INFORMATION: n is a, c, g, or t 

147 <22 0> FEATURE: 

148 <221> NAME/KEY: misc__f eature 

149 <222> LOCATION: (362).. (362) 

150 <223> OTHER INFORMATION: n is a, c, g, or t 

152 <220> FEATURE: 

153 <221> NAME /KEY : misc_f eature 

154 <222> LOCATION: (364).. (364) 

155 <223> OTHER INFORMATION: n is a, c, g, or t 

157 <22 0> FEATURE: 

158 <221> NAME /KEY : misc_f eature 

159 <222> LOCATION: (377).. (377) 

160 <223> OTHER INFORMATION: n is a, c, g, or t 
163 <4 00> SEQUENCE: 11 

r > 164 agaaacttaa acagcggata aacttcagtg atttanatca gagaagtatt ggaagtgatt 60 

165 ctcaaggtan agcaacagcg gctaacaaca aacgtcagct tagtgaaaac cgaaagccct 120 

166 tcaacttttt gcctatgcag attaatacta acaagagcaa ggatgctact gcaagtcttc 180 

167 caaagagaga gatgacaacg tcagcacagt gcaaagagtt gtttgcttct gctctaagta 240 

168 atgacctttt gcaaaactgt caatctctga agaagatggg agaggggagc ctgcatggga 300 

169 aacaccagat tgtaagcagg cttgttcaat cctgactata ttactaaagc tagttctatg 360 

170 cnanaagttt tgtaaanaaa atgaaagtct gcaatgttga 4 00 

172 <210> SEQ ID NO: 12 

173 <211> LENGTH: 416 

174 <212> TYPE: DNA 

175 <213> ORGANISM: Mus musculus 

177 <220> FEATURE: 

178 <221> NAME/KEY: misc_f eature 

179 <222> LOCATION: (37).. (37) 

180 <223> OTHER INFORMATION: n is 

182 <220> FEATURE: 

183 <221> NAME /KEY : misc_f eature 

184 <222> LOCATION: (363).. (363) 

185 <223> OTHER INFORMATION: n is 

187 <22 0> FEATURE: 

188 <221> NAME/KEY: miscj eature 

189 <222> LOCATION: (392).. (392) 

190 <223> OTHER INFORMATION: n is 
192 <220> FEATURE: 



a, c, g, or t 



a, c, g, or t 
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RAW SEQUENCE LISTING DATE: 03/02/2004 

PATENT APPLICATION: US/10/030,658 TIMe! 08:47:46 

Input Set : A:\2004-01-22 4456-0101P.st25.txt 
Output Set: N:\CRF4\03022004\J030658.raw 

193 <221> NAME /KEY : misc_feature 

194 <222> LOCATION: (401).. (401) 

195 <223> OTHER INFORMATION: n is a, c, g, or t 

197 <220> FEATURE: 

198 <221> NAME /KEY : misc^feature 

199 <222> LOCATION: (403).. (403) 

200 <223> OTHER INFORMATION: n is a, c, g, or t 
203 <4 00> SEQUENCE: 12 

W— > 204 tcttctagct ttgcagcata aagcagagca agctatnagc tgtgatggat gactctgttg 60 

205 ttacagaaac tacaggaagc ttatctggag tcagcatcac atctgaacta aatgaagaac 120 

206 tgaatgattt aattcagcgt ttccataatc agcttcgtga ttctcagcct ccagctgttc 180 

207 cagacaacag aagacaggca gaaagtcttt cattaactag agagatttct cagagcagaa 240 

208 atccctcagt ttctgaacat ttacctgatg agaaagtaca gctttttagc aaaatgagag 300 

209 tactacagga aaagaacaag aaatggacaa attagttggg agaacttcat aaccttcgag 360 

210 atnagcatct gaacaactca tcatttgtgc cntcaacttc ncnccaaaga aqtqqq 416 

212 <210> SEQ ID NO: 13 

213 <211> LENGTH: 484 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Mus itiusculus 

217 <220> FEATURE: 

218 <221> NAME/KEY: raisc_feature 

219 <222> LOCATION: (33).. (33) 

220 <223> OTHER INFORMATION: n is a, c, g, or t 

222 <220> FEATURE: 

223 <221> NAME/KEY: miscjeature 

224 <222> LOCATION: (48).. (48) 

225 <223> OTHER INFORMATION: n is a, c, g, or t 

227 <220> FEATURE: 

228 <221> NAME/KEY: misc_f eature 

229 <222> LOCATION: (54 ).. (54) 

230 <223> OTHER INFORMATION: n is a, c, g, or t 

232 <220> FEATURE: 

233 <221> NAME /KEY : misc_feature 

234 <222> LOCATION: (89).. (89) 

235 <223> OTHER INFORMATION: n is a, c, g, or t 

237 <220> FEATURE: 

238 <221> NAME/ KEY : misc^f eature 

239 <222> LOCATION: ( 24 4 ) . . ( 24 4 ) 

240 <223> OTHER INFORMATION: n is a, c, g, or t 

242 <220> FEATURE: 

243 <221> NAME /KEY : miscjeature 

244 <222> LOCATION: (257).. (257) 

245 <223> OTHER INFORMATION: n is a, c, g, or t 
24 8 <4 00> SEQUENCE: 13 

W— > 24 9 gtttctacac ctactgaaca gcagcagcca ttnagctcaa aatccttnca gggnaaaaca 60 

250 gagtatatgg cttttccaaa accctctgna aagcagttct tctcttggag cagaaaagca 120 

251 aaggaatcaa gaaacagccc gaagaggaag ctgaaaacac taagacacca tggttatatg 180 

252 atcaagaagg tggagtagaa aaaccatttt tcaagactgg atttacagag tctgtagaga 240 

253 aagntacaaa atagtanccg caaaaatcaa ccagatacaa gcaggagaag acgtcggttt 300 
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RAW SEQUENCE LISTING DATE- 03/02/2004 

PATENT APPLICATION: US/10/030,658 TIME: 08:47:46 

Input Set : A:\2004-01-22 4456-010lP.st25.txt 
Output Set: N:\CRF4\03022004\J030658.raw 

254 gatgaagaat cccttggaaa gctttagcag tatgcctgat cctatagacc caacatcagt 360 

255 aactaaaaca tttaaaacaa gaaaagcatc tgcccaggcc agcctggcct ctaaggacaa 420 

256 aactcccaaa tcaaagagta agaagaggat tctactcagc tgaaaagtag agttaaaaat 480 

257 attg 

260 <210> SEQ ID NO: 14 

261 <211> LENGTH: 211 

262 <212> TYPE: DNA 

263 <213> ORGANISM: Mus musculus 

265 <400> SEQUENCE : 14 

266 ctgtctgtca ttgtcgttct cctttagaag gcagaaaaga aatgggaaga aaaaaggcaa 60 

267 aatctggaac actataacgg aaaggagttc gagaagctcc tggaggaagc tcaggccaac 120 

268 atcatgaagt caattccaaa cctggagatg cccccagctt ccagcccagt gtcaaaggga 180 

269 gatgcggcag gggataagct ggagctgtca g 211 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE : 03/02/2004 

PATENT APPLICATION: US/10/030,658 TIME: 08:47:47 

Input Set : A:\2004-01-22 4456-0101P.st25.txt 
Output Set: N:\CRF4\03022004\J030658.raw 



Please Note : 



Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 

JTSS Sl^c^JT™ 3 C ° rreS P ° nding £ presented In ^220> 

to <223> fields of each sequence which presents at least one n or Xaa. 



Seq#:ll; N Pos . 36,70,362,364,377 
Seq#:12; N Pos. 37,363,392,401,403 
Seq#:13; N Pos. 33,48,54,89,244,257 
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VERIFICATION SUMMARY DATE : 03/02/2004 

PATENT APPLICATION: US/10/030,658 TIME : 08:47:47 

Input Set : A:\2004-01-22 4456-0101P. st25 txt 
Output Set: N:\CRF4\03022004\J030658.raw 



L:164 M.-341 W: (46) "n" or "Xaa" used, for SEQ 1D#:11 after pos . : 0 
M:341 Repeated in SeqNo^ll 

L:204 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:12 after pos . : 0 
M:341 Repeated in SeqNo=12 

L:249 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 after pos . : 0 
M:341 Repeated in SeqNo=13 
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